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Abstract The digital record of the voltage in a radio telescope receiver, after fre- 
quency conversion and sampling at a finite rate, is not a perfect representation of the 
original analog signal. To detect and characterise a transient event with a duration 
comparable to the inverse bandwidth it is necessary to compensate for these effects, 
which modifies the statistics of the signal, making it difficult to determine the signifi- 
cance of a potential detection. We present an analysis of these modified statistics and 
demonstrate them with experimental results from Bedlam, a new digital backend for 
the Parkes radio telescope. 

Keywords Coherent pulse detection • Gaussian noise statistics • Digital radio 
receivers • ADC non-linearity 



1 Introduction 

Typical radio astronomy applications are not sensitive to transitory extreme fluctua- 
tions in the signal voltage from the telescope receiver. For example, when measur- 
ing the spectral intensity of a static astronomical object, mere single-bit sampling 
of the voltage achieves a minimum of 64% of the signal-to-noise ratio for the ideal 
many-bit case, even though th e ability to measure the magnitude of a single sample 



is entirely lost dWeinrebl ll963. sec. III). However, in applications in which a single 



high-magnitude sample is observationally meaningful — implying a signal with time 
structure on the scale of the inverse bandwidth — it becomes necessary to examine 
the single-sample behaviour of the telescope receiver, and to develop an understand- 
ing of its statistics. 
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One such application is the search for nanosecond-scale broad-band radio pulses 
from neutri no-induced particle cascades in the Moon, curren tly being co nducted with 



the WSRT (Buitink et al., 2010), EVLA (Jaeger et al., 2010) and ATCA (James et al 



2010) telescopes. These pulses are inherently very narrow in time, with a detectable 



width approximately equal to the inverse bandwidth resulting from band-limiting 
in the radio receiver. A single such pulse would appear in the voltage recorded by 
the receiver as a transitory excursion — possibly only a single sample — beyond 
the statistical bounds of the background Gaussian noise, here dominated by ther- 
mal radiation from the Moon. If such an event were detected, and a possible origin 
as radio frequency interference (RFI) were excluded, it might indicate the registra- 
tion of an ul tra-high energy neutrino, with significant implications for cosmic ray 



astrophysics (IBeresinskv and Z atsenin. 1969). The confidence of the detection would 
depend on the expected rate of false positives from noise in the receiver, which we 
address in this paper. 

Radio receivers typically convert the incoming signal from its original radio fre- 
quency to a lower intermediate frequency by mixing it with the single-frequency 
output from a local oscillator. This process affects time-averaged properties of the 
signal, such as the spectral intensity, in a predictable fashion, but the effect on the 
shape and height of an individual peak will depend on its phase relative to the local 
oscillator. Since this phase is usually unknown, the original signal cannot be unam- 
biguously reconstructed; to search for peaks in the original signal, it is necessary to 
reconstruct all of its possible states, searching in phase for the maximum peak height. 
In section [2] we i l lustra te this effect, and show that the signal envelope defined by 



Longuet-Higgins ( 1957) reconstructs the maximum possible peak height. 



The fraction of digital samples above some detection threshold can be trivially 
calculated with Gaussian statistics. However, a peak in the original analog signal will 
generally lie between two sampled points, ne ither of which will record its full mag- 
nitude, as first pointed out in this context by iJames et al. I d201(l The values of the 



analog signal between the sampled points, including any such peaks, can be recon- 
structed through convolution with a sine function as a consequence of the Nyquist- 
Shannon sampling theorem. The number of e xpected excursions beyond some thresh- 



old in this interpolated signal was derived by iRicd d 1 944 1 1 9451) : in section[3]we ex- 
plain the application of this result and extend it to find the excursion rate for the signal 
envelope. 

We test the above results through simulation and experiment. In section|4]we per- 
form Monte Carlo simulations of the excursion rates for noise with various spectra, 
finding them to be consistent with the analytic results. In section [5] we present the 
results of an experiment in which a noise source was connected to "Bedlam" — a 
new digital backend for the Parkes radio telescope — which recorded extreme fluctu- 
ations in the signal voltage for a period of 20 days. Bedlam was designed and built for 
LUNASKA (Lunar Ultra-high energy Neutrino Astrophysics with the S quare Kilo 



metre Array), a neutrino detection project of the type described above (Bra y et al 



2012). The experiment described here allows us to test its consistency with analytic 



and simulated models of the noise behaviour, and to calibrate any variation. 
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2 Frequency mixing 

Shifting a signal in frequency by mixing it with a local oscillator signal, as in a typical 
radio receiver, also changes the phase of the signal. To find a possible peak in the 
original signal, it is necessary to search through the possible values of this phase 
to find the maximum peak height. Here we illustrate this effect, and show that the 
phase-searched maximum signal can be represented by an analytic envelope of the 
signal. 



2.1 Origin of the phase ambiguity 

The mixing stage in a radio receiver can be represented as a multiplication between 
the original radio frequency signal f RF (t) and a local oscillator signal /lo(0' equiv- 
alent to a convolution .Frf(v) *F l0 (v) in the frequency domain. The local oscillator 
and its Fourier transform F L0 (v) are given by 

f L0 (t) =COs(27TV LO f + 0) 

^lo(v) - l - e<* S(v - v L0 ) + \e-* S(v + v L0 ) ! ' ' 

where describes the phase of the local oscillator relative to the signal. After subse- 
quent bandpass filtering to obtain the low-frequency component, the resulting inter- 
mediate frequency signal fw(t) is described, in the frequency domain, as: 

F lF {v) = ^F RF (v-v L0 ) + ^e-^F RF (v + v L0 ) |v|<v L0 . (2) 

For a low-side local oscillator (i.e. v L0 is less than all frequencies v present in the 
radio frequency signal), this becomes 

Fif(v ) = I e -^i g „(v) I>rf(v + v lo ) < v < v lo (3) 
2 \f rf (v-v l0 ) -v lo <v<0 

showing that the intermediate frequency signal depends on the phase factor e~ 1 ^ sl § n ( v ) . 
The expression for a high-side local oscillator is similar, but with the sign of re- 
versed; additional mixing stages add additional phases 0\, 02, etc. If the phase of the 
local oscillator is unknown, the effect in these cases is the same: the intermediate 
frequency signal includes an unknown phase in the range — > 2n. 



2.2 Phase searching and the signal envelope 

The effect in the time domain of introducing a variable phase can be to decrease 
the peak amplitude of the pulse, as illustrated in figure Q] As is unknown, it is not 
possible to unambiguously reconstruct the original signal. However, given a post- 
mixing signal f(t) = fw(t), it is possible to reconstruct the different possible states 
of the original signal for different values of . An excursion beyond some threshold 
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in the original signal must appear as a peak of at least that height in one or more of 
these reconstructed signals. We define h(t, 9) as the reconstructed form of /(f) with 
maximum peak height; in terms of their respective Fourier transforms, H(v, 9) and 
F(v), it is 

H(V,8) =e iesisa ^F(v) (4) 

where 9 is chosen to maximise the peak in h(t,9). For 9 = <p this will restore the 
phase of the original signal (cf. equation^, but this is not generally the case. We can 
re-express the reconstructed signal as 

H(v,B) =F(v) cos 9 + G{v) sin0 (5) 

where 

G(v) = isign(v)F(v). (6) 
Equivalently to equation|5J in the time domain, 

A(f,0)=/(f)cos0+g(f)sin0. (7) 

Since 9 is chosen to maximise hit, 9), we know that 

2gh(t,6) = 0, (8) 

hence 

wr twe (9) 

and h(t, 9) from equation|7]can be rewritten without 0-dependence as 

h{t) = y/fW+gW, (10) 

where the choice of the positive square root reflects the choice of a positive peak in 
hit). We can express git) by using the convolution theorem to find the time-domain 
equivalent to equation [6] 

g(t) = ^r*m, (11) 

Kt 

which is the Hilbert transform of fit). This makes hjt) the same a s the signal en- 
velope defined and extensively discussed by Longue l>Higginsl dl957h : here we will 
consider only its excursion rate, in sections [331 and [3~4l 
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-2 2 

Time (sampling intervals) 

Fig. 1 A coherent, zero-phase, flat-spectrum pulse (i.e. a sine function) against a background of Gaussian 
random noise with the same spectrum (solid). Adding a variable phase <j> to the signal, as per equationfj] 
results in a range of possible pulse shapes, some of which are shown (dotted), with a decreased peak 
amplitude. These represent different possibilities for the signal /(f) from section IT2l The signal envelope 
h(t) (dash-dotted), from equation fTol recovers the full amplitude of the pulse. 



3 Excursion rates 



The significance of an event detected in a stochastic signal is dependent upon the 
expected rate of false detections, given the characteristics of the signal. Here we con- 
sider the case of pulse-like events, involving an excursion beyond some high thresh- 
old, in a Gaussian random signal with an arbitrary spectrum, and find the expected 
rate of such excursions. 



We give the excursion rate for a raw digital signal (i.e. after sampling, but before 
further digital processing) in section 13.11 and after perfect interpolation (i.e. for a 
continuous signal) in section l3~2l These results are known, but the working provides 
a useful foundation for sections 1331 and I3~4l in which we derive the equivalent results 
for the signal envelope (as described in section l2~2l . 



The excursion rates giv en here are one-s ided, beyond a positive threshold only, 
following the convention of lRice ( 1944 , 1945). For a Gaussian signal, the two-sided 
excursion rates will be exactly twice these values. The signal envelope is always 
positive, so for it the distinction is unimportant. 
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3.1 Raw signal 

Immediately after sampling, a signal such as the measured voltage in a radio receiver 
consists of a series of Gaussian samples /(f,) with probability density 

P {f) = -±—e-f 2 ^ 2 (12) 
GV2n 

where a is the standard deviation or root mean square (RMS) signal amplitude, and 
the mean is assumed to be zero. An excursion in the signal consists of one or more 
consecutive samples above some threshold / t h r . If / t hr is sufficiently high that it is 
rare for two consecutive samples to exceed it, the probability that a particular sample 
exceeds the threshold gives us the expected rate of such excursions, 



excursion rate = 




in units of the sampling rate, with erf being the standard Gauss error function. How- 
ever, if the signal samples are strongly correlated in time — as for an oversampled 
signal, or after interpolation — a single excursion will typically involve a number of 
consecutive samples, so this approach is insufficient. 



3.2 Interpolated signal 

After interpolation to produce a continuous signal, the excursion rate is found by 
means of Rice's formula, which gives the rate of upward crossings — and, hence, 
distinct excursions — over a threshold ffa for a signal / = f(t), as 

excursion rate = f df' f P(/thr,/') (14) 
Jo 

where /' is the time derivative of /, an d P(f , f') is their joint probabili ty distribution . 



An early form of this result appears in Rice ( 19451 eqn 3.3-5); refer to RainaJ dl988h 



for a detailed discussion. P(f,f r ) can be rewritten as 

P(f,f')=P(f)P(f'-J) (15) 

where P(f) is the distribution of / (from equation[T2li and P (/';/) is the distribution 
of /' conditional upon the value of /. To find this latter distribution, we consider 
points at closely separated times t and t + dt. If /(f) is a correlated Gaussian signal, 
the n the cor r espon ding values / and f + df follow a joint normal distribution (see 
e.g. iKennevI dl939i ch. 4)): given the value of /, / + df is normally distributed with 
expected value and variance 

E[f + df]=fp(dt) 

i / i, (lo) 

V ar [f + df} = a 2 (l-p(dt) 2 ) 
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where p{dt) is the value of the autocorrelation function (Fourier transform of the 
power spectrum, normalised to p(0) = 1) at separation dt. The slope /' = df /dt is 
then also normally distributed, with expected value and variance 

at 

Taking the limit dt — > and the expressions from equation[l6] and noting that p'(0) = 
0, we obtain 

E[f'}= 
Var[/] = -cr 2 p"(0) 
which represents the distribution 

P{f'-J)=P{f) irrespective of/ 

= *~ f /2g (19) 



(18) 



qV2n 
where 



g = v/VarLf] 



= oy/-p"(0). (20) 

With the expressions for P(f) and P (/';/) from equations [T2l and [T9l respectively, 
equationfT4l gives us 



excursion rate = — J-p"(0)e f ^ la (21) 

27T 

as found by iRicd ( 19451 eqns 3.3-11 & 3.6-1 1). For discrete data, as in section[5]of 
this work, p (f ) can be obtained numerically from the measured power spectrum. 

3.2.1 White noise 

When the power spectrum is known analytically, equation|2T|can be simplified. Con- 
sider the special case of white noise, with constant power spectral intensity from 
frequency v a to Vb, i.e. 

/(v) = { 1 V «<I V I <V « (22) 
I otherwise. 

This function has the Fourier transform 

(0 = 2v 6 sinc(2v 6 -2v a sinc(2v a *) (23) 

where sinc(jc) = sin(nx)/ (%x). Normalising this, we obtain the autocorrelation func- 
tion 

_ v fo sinc(2v j7 f)-v„sinc(2v a f) 
P[) v b -v a ■ (24) 
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The second derivative of this function at t = is 



p"(0) 

which we insert into equation [2J] to obtain 



vf, - Vi 



r 2 v a 



— K 

3 V a - V h 



(25) 



excursion rate = 



3 V a - V b 



(26) 



matching the result found for this case by Rice (1945, eqns 3.3-12 & 3.6-1 1). 



3.3 Signal envelope 

The signal envelope h(t), per equation [TOl is a function of the signal /(f) and its 
Hilbert transform g(t). We take both f — f(t) and g = g(t ) to be normally distributed 
with standard deviation a. The distribution of h = h(t) depends on the degree of 
correlation between them. Given the value of /(?), the expected value of the signal at 
other times is 

E[/(»-T)]=/(f)p(T) (27) 



where p (t) is the autocorrelation function as in section [3T2l The expected value of g, 
from equation [TT] is then 

J nx 
J m 

= as — is odd, and p(r) is even. (28) 

Therefore there is no correlation between / and g; and, from equation[lOl h follows 
a Rayleigh distribution 

P (h) = JL e -i> 2 /2° 2 (29) 

<7 Z 



as found by Longuet-Higginsl ( 1957h . If consecutive values ft(f,-) are not significantly 



correlated, and we choose a threshold sufficiently high that consecutive values 
are unlikely to exceed it (as in section 13.11 ), this gives us the excursion rate of the 
signal envelope: 

excursion rate = / dhP(h) 

= e- h l<' 2a2 (30) 



in units of the sampling rate. 



Noise statistics in the Parkes Bedlam backend 



9 



3.4 Interpolated signal envelope 

To find the excursion rate for the interpolated signal envelope h, we apply the ap- 
proach from section [3~2l which requires us to find the distribution of its time deriva- 
tive b! . From equationlTOl 

h' = \{ff+gg'). (31) 

The distribution of this value depends on the correlation between its components. We 
know from section [3~2l that /' is normally distributed with mean and variance g 2 , 
and uncorrelated with /; the same applies to g' with respect to g. We also know from 
section [3~3l that / and g are uncorrelated. We now consider the correlation between / 
and g' , the latter of which is found as 

g'(t) = j f (i*f(*)j from equationE] 

= -i */(')• ( 32 ) 
Hence the conditional expected value of g', given /, is 

E[g'}=- |rf T -L E [/( ? -T)] 

dx — 7rf{t)p{x) from equation [27] 

ItX 

= a-f (33) 
a 



where 



a = — [ di-^pix) 
q J tit 2 

-l r . i 



dx — ~p(T) from equationl20l (34) 

KX 1 - 



is the coefficient of correlation between / and g'. The variance of g' is then 

Var[s'] = g 2 (l-a 2 ) (35) 

conditional upon the value of /. 

We consider the correlation between g and /' in a similar manner. First we note, 
from equation[TT] that we can obtain / via the inverse Hilbert transform 

/(') = "**(')• ( 36 ) 
%t 



Hence 



,;««(<) (37) 
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and the conditional expected value, given g, is 



(38) 



The conditional variance, Var[/'], is identical to Var[g'] in equation[35] as it is insen- 
sitive to the sign of a. 

We know that h', given / and g, is the sum of two independent normal variables 
(corresponding to the terms in equation [3T1). and hence normally distributed itself. 
We find its expected value and variance: 



E[h'} = ~(fE[f}+gE[g'}) 



Vai^] = ^(/ 2 Var[/']+g 2 VaiV]) 
f+F 2 



h 2 

g 2 (l-a 



Var[/'] 



from equations 1331 and [381 (39) 



as Var[/'] = Var[g'] 
from equations [TOl and 1331 



(40) 



The distribution of h! is thus 



P(h';h)=P{ti) 



1 



g h V2n 



irrespective of h 



(41) 



where 



ft = VVar[A'] 
= ffV-p"(0)v / l-a 2 



from equations l20l andl40l 



(42) 



With this distribution, and the distribution P(h) from equation|29] we can determine 
the excursion rate beyond a threshold from equation[l4l finding that 



excursion rate = 



2k 



-P"(0)-(/*tJL P (t)) h J± e -*l/^ 



(43) 



If the integral in this expression is to be evaluated with numerical data, with values 
of p (f ) calculated for discrete time intervals, note that the analytic result 



KE 



(44) 



should be used for the interval — e < z < £ to avoid the singular behaviour at z = 0. 
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3.4.1 White noise 

Here we consider white noise from frequency v„ to Vf, as in section [J. 2. II and derive 
the excursion rate for the signal envelope in this case. The autocorrelation function 
for white noise is given by equation [24] which allows us to evaluate the integral in 
equation |431 as 



Also incorporating p"(0) from equation|25j we can evaluate the excursion rate from 
equationl43lto be 



4 Monte Carlo simulation 

The results from section[3]can be tested through Monte Carlo simulation: generating 
random noise data and comparing the distribution of peak heights (for the signal and 
its envelope, with and without interpolation) with the analytic predictions. This also 
provides a helpful illustration of the effects of interpolation and the phase search 
represented by the signal envelope on the excursion rate. 

4. 1 Uncorrelated noise 

The simplest case to test through simulation is that of uncorrelated noise. We gener- 
ated 100,000 arrays, each containing 4,096 independent normally-distributed random 
values. For each array, we performed the following steps: 

1 . Find the maximum value in the array. 

2. Perform interpolation on the array, and find the maximum interpolated value. 

3. Take the norm of the array and its Hilbert transform to find the signal envelope, 
as in equation[lO] Find the maximum value in the resulting array. 

4. Repeat step [3] with an interpolated version of the array, and find the maximum 



The interpolation in steps [2] and [4] was 32-fold, with 31 interpolated values between 
each pair of raw values. This level of precision was chosen as it resulted in maximum 
reconstructed values within 0. 1 % of those resulting from finer searches, while being 
relatively inexpensive to compute. 

The number of excursions beyond some threshold, under each of the conditions 
above (raw values; interpolation; signal envelope; interpolated envelope), can be con- 
sidered to be the number of arrays containing a peak height above that threshold. This 
leads to an underestimate of the excursion rate for low thresholds, for which multiple 
excursions within a single 4,096-value array are common; but it is accurate for high 
thresholds, for which this is rare. 




(45) 




(46) 



value. 
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Threshold peak height (er) 

Fig. 2 Cumulative histograms (solid) of the peak height in an array for white-spectrum Monte Carlo 
data, for the signal and its envelope, with and without interpolation, compared with analytic predictions 
(dashed). The divergence at low thresholds is due to arrays containing multiple peaks, which are counted 
only once. The divergence at high thresholds is due to small-number statistics. 

As the values are uncorrelated, the expected excursion rates at high thresholds for 
the uninterpolated data are given by equation [T3] for the raw signal and equation [30l 
for the signal envelope. Uncorrelated values also result in a white spectrum, with a 
constant spectral intensity up to an implied maximum frequency of 0.5 cycles per 
sampling interval, allowing the use of equations l26l and l46l to predict the excursion 
rates for interpolated data, for the signal and its envelope respectively. In figure [2] 
we compare these excursion rates, in terms of the total expected number of excur- 
sions, with cumulative histograms of the peak height in an array, finding them in 
good agreement. 



4.2 Correlated noise 

To test the dependence of the excursion rate on the autocorrelation of the noise, we 
performed another Monte Carlo simulation with a different spectrum. Arrays of ran- 
dom values were generated with a Gaussian power spectrum 



I(V) - e - v2 /2a v 2 



M<V,/2 



(47) 
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Fig. 3 Cumulative histograms (solid) of the peak height in an array for Gaussian-spectrum Monte Carlo 
data, for the signal and its envelope, with and without interpolation, compared with analytic predictions 
(dashed). Divergence at low and high thresholds, as in figure [2] is due to under-counting of peaks and 
small-number statistics respectively. Analytic predictions for the case of uncorrelated noise (dotted) are 
shown for comparison. 

where o v = V s /5 is defined in terms of the sampling rate v^. Peak heights for the 
signal and its envelope, with and without interpolation, were then found through the 
same procedure as in section |4~T1 

As the values in this case are correlated, the excursion rates for interpolated data 
must be found with equation [2TJ for the signal and equation [43] for its envelope, in 
terms of the autocorrelation function p(f). This function can be derived analytically 
from the power spectrum in equation [47] but was instead obtained numerically in 
order to validate the procedure to be used in section [5] Unlike the uncorrelated data 
in section |4~T1 excursion rates for correlated uninterpolated data are not expected to 
be given exactly by equations [T3l and [30l 

Cumulative histograms of peak heights are compared with these predictions in 
figure [3] The histograms for interpolated data agree closely with the predictions, sig- 
nificantly lower than the excursion rates expected for uncorrelated noise. 

5 Experimental test with the Parkes Bedlam backend 



Bedlam is a digital backend for the Parkes radio telescope, designed and b uilt to 
perform transient pulse detection for the LUNASKA project dBrav et all I2012H . We 
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Fig. 4 Signal path for two active channels in the Parkes Bedlam backend (see section l5TTl for details). In 
normal operation (outer signal paths) it receives input from the Parkes radio telescope; in this experiment, 
it was instead connected to a noise generator (centre). A trigger condition (dashed) causes the contents of 
the buffers to be stored. 



performed an experiment in which Bedlam processed data from a noise source, the 
results of which allow us to test the performance of the hardware and the results from 
section [3] 



5.1 The Parkes Bedlam backend 



In nor mal operation, Bedlam is connected to the Parkes 21 cm multibeam receiver (Stavelev-Smit h et al. 



1996). It has eight inputs, typically receiving the voltage signal from both polarisa 



tions for four of this receiver's thirteen beams. The receiver amplifies the signal and 
downconverts it to a lower frequency (see section lXTT ); it then enters Bedlam where it 
is digitised by analog-to-digital converters (ADCs) at 1,024 Msamples/s (correspond- 
ing to 512 MHz of bandwidth) and 8 bits of precision, and subjected to an adjustable 
dedispersion filter to compensate for ionospheric dispersion. Buffers of the raw and 
dedispersed signals are kept, with an adjustable size up to 8 fls. 

The dedispersed data are then interpolated. Full reconstruction of a Nyquist- 
sampled signal is achieved by convolving with a sine function; interpolation here 
is performed as a convolution with a six-point approximation to a sine function (at 
±0.5, ±1.5 and ±2.5 sampling intervals), producing a single interpolated point be- 
tween every pair of original samples. If a value is found over an adjustable thresh- 
old, in either the original or interpolated values, it triggers the permanent storage of 
both raw and dedispersed buffers for all input channels. Anticoincidence logic can be 
applied between channels to prevent triggering from simultaneous above-threshold 
values in multiple channels, but was inactive for this experiment. 

In this experiment, all eight channels were connected to the same noise source. 
However, thresholds were set such that only two of these were able to trigger buffer 
storage. The signal path for these channels is shown in figure|4] 



5.2 Experiment 

The Bedlam backend was operated with pure noise input for 478 hours, starting on 
23 September 2010. The noise source was a reverse-biased diode in breakdown, with 
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a passive eight-way power-splitter to connect it to all eight input channels. In this 
time, there were 902,1 15 peaks in the noise signal sufficient to trigger storage of the 
buffered data, with the buffer size set to 4 fis (4,096 samples). During the 51 ms 
period taken to store a single set of buffers of this length, the system is unable to 
respond to any further triggers; this dead time amounts to 2.7% of the experimental 
period. The dedispersion filter was set to its minimal setting, equivalent to ~ 2 ns 
differential delay across the band of the noise. 

The RMS noise amplitude was typically ~ 12 analog-to-digital units (ADU), 
compared to a maximum of +127 ADU for an 8-bit signed integer. The tempera- 
ture of the noise source was not controlled, so the gain varied over the course of the 
experiment, resulting in ~ 5% slow variation in the RMS. More dramatic fluctuations 
occurred during a 8-hour period partway through the experiment, possibly due to a 
problem with the power supply, so this interval was excluded. After also excluding 
dead time, the remaining effective duration of the experiment is 458 hours. 

Typical stored buffers consisted mostly of pure noise, with a single extreme value 
that exceeded the trigger threshold in the centre of the buffer, as expected from Gaus- 
sian noise with a relatively wide band. However, there were two atypical events which 
exhibited increased signal power for an extended period, over 100 ns (see figure [5). 
These events are not consistent with pure Gaussian noise; their origin is unknown, but 
more such events were found during the period excluded due to RMS fluctuations, 
which suggests that they may be related phenomena. These two events were excluded 
from further analysis. 

A minor bug resulted in spurious recorded values at the start and end of each 
buffer, so these were excluded from the analysis; the triggering value, at the centre 
of the buffer, is unaffected by this. Peaks in the spectrum at and 512 MHz — the 
bottom and top of the band — were removed, the former being a direct-current offset 
and the latter probably resulting from internal interference from the digital hardware, 
at a harmonic of its clock speed of 256 MHz. Lesser interference at harmonic and 
subharmonic frequencies was considered insignificant and not removed, but is just 
visible in the noise spectrum shown in figure [6] 



5.3 Excursion rates 

Excursion rates for the signal and its envelope, with and without interpolation, were 
found as in section |4] for post-dedispersion buffers for a single channel of Bedlam. 
Although the signal in this case does not undergo frequency mixing, unlike normal 
operation of the Parkes radio telescope (see figure |4), random noise has no preferred 
phase and hence displays the same characteristics with or without this step. For the 
same reason, the dedispersion filter has no effect on the characteristics of the signal 
other than to change its spectrum. The spectrum after filtering is shown in figure|6]and 
the derived autocorrelation, for determining the expected excursion rates, is shown in 
figure 13 

The significance of the peak in each buffer was calculated against a value of <7 
equal to the combined RMS of all buffers within a moving 10-minute window, to re- 
duce the uncertainty associated with the limited number of samples in a single buffer. 
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Fig. 5 The two most significant peaks, after dedispersion, recorded over the course of the experiment. 
The upper peak displays the narrow profile typical of rare fluctuations in Gaussian noise. The lower peak 
includes an extended elevation in signal power, possibly indicative of an instrumental problem, and is one 
of two such peaks excluded from the analysis (see text). In both cases, similar peaks are seen in the other 
channels. 



The central portion of each buffer, containing the triggering value, was excluded from 
the RMS calculation to avoid bias. The RMS variation over a 10-minute period was 
~ 0.5%, resulting in a corresponding uncertainty in the peak height. 

Quantisation error in the ADCs of ±0.5 ADU is equivalent to 0.5% at 8(7. Summed 
in quadrature with the uncertainty from RMS variation, above, this gives a 0.7% un- 
certainty in the height of a single peak. This has a negligible direct effect on the mea- 
sured excursion rates for low threshold values, as it is averaged over a large number 
of excursions; it applies in full only to the single highest peak. However, it indirectly 
causes a systematic effect: due to random increases or decreases in the peak height, 
the rate of excursions beyond a particular threshold will be a weighted average of the 
excursion rates that would occur at lower or higher thresholds in the absence of this 
variation. This effect is equivalent to convolving the excursion rate as a function of 
threshold with the random error distribution. Making a Gaussian approximation to 
the latter, we find that 0.7% random error leads to a ~ 0.2% increase in the threshold 
at which a given excursion rate occurs. 

Figure [8] shows the measured excursion rates compared to the predictions from 
equations |2T1 and l43l modified by the above effect. As in section l4~2l (figure l3t. pre- 
dictions for the case of uncorrelated white noise are also shown. The results are in 
reasonable agreement: the difference between the excursion rates for the signal and 
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Fig. 6 Power spectrum of the recorded dedispersion-filtered noise data. The peaks at 128 and 256 MHz 
(marked) are internal interference from the digital signal-processing hardware. 
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Fig. 7 Autocorrelation function p(f) for the recorded noise data around t = 0, derived from the power 
spectrum in figure[6] 
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Threshold peak height (er) 

Fig. 8 Measured excursion rates (solid) of the peak height in each buffer for experimental data with the 
Parkes Bedlam backend, for the signal and its envelope, with and without interpolation, compared with 
analytic predictions (dashed). At low thresholds, there is a deficit in the excursion rates due to incomplete 
triggering. At high thresholds, there is divergence due to random error and small-number statistics. The 
systematic offset associated with the random error (see text) has already been applied to the predicted 
curves; we ascribe the remaining consistent excess at intermediate thresholds to non-linearity, which is 
measured as shown and discussed in section 15.41 Analytic predictions for the case of uncorrelated noise 
(dotted) are shown for comparison. 

for its envelope is clearly visible, although the increased error in the experimental 
data relative to the Monte Carlo simulation makes it impossible to clearly discrimi- 
nate between the predictions for uncorrelated noise and for the actual spectrum. 

The deficit in the excursion rate at low thresholds is not caused by multiple ex- 
cursions in a single array, as in the Monte Carlo data, but by the Bedlam backend 
failing to trigger on some lower-amplitude peaks. As the triggering algorithm acts 
only on the signal (rather than its envelope), and performs only partial interpolation, 
a peak which is of low amplitude in the raw data but high amplitude after interpo- 
lation or in the envelope will be missed; consequently, the deficit extends to higher 
thresholds for interpolated and envelope data. There is divergence at high thresholds 
from small-number statistics, as for the Monte Carlo data, but with an additional 
contribution from the random error described above. 

The remaining discrepancy in figure [8]is the small systematic excess at interme- 
diate thresholds, most clearly visible for the raw and interpolated data. Measuring 
this excess relative to the theoretical excursion rate for interpolated data at 7.5(7, as 
a compromise between uncertainty from incomplete triggering (at lower thresholds) 
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and from small-number statistics (at higher thresholds), this is equivalent to a 0.4% 
offset in the threshold. The number of excursions beyond this threshold is 278, which 
is sufficient to make the averaged random error in the peak height negligible, but there 
is a remaining effect from small-number statistics: the Poisson error of ±1/ V278 in 
the number of excursions results in a ±0.1% uncertainty in the measurement of this 
offset. We discuss the cause of this offset in the next section. 



5.4 Non-linearity 

In an ideal ADC, there is a direct linear relationship between the thresholds in the 
analog input voltage required to produce particular digital output codes, and the val- 
ues of those output codes (in ADU). In practice, there are minor offsets between these 
thresholds and their ideal values. The maximum such offset for a given ADC is its 
integral non-linearity ( INL); for the hardware used in this experiment, this is adver- 
tised as 0.4 ADU de2v|,[2010). This non-linearity affects measured peak heights, and 
hence excursion rates, through two different mechanisms. 

The first effect is a direct error in the measurement of the peak height, with a 
magnitude less than or equal to the INL. For this experiment, an INL of 0.4 ADU 
leads to a possible 0.4% error for peaks at 7.5a. The direction and magnitude of the 
error may vary across the voltage range of the ADC, controlled by the difference in 
the non-linearity between adjacent voltage thresholds (the differential non-linearity, 
or DNL). In general, we expect the error for peaks in the positive direction (around 
+ 100 ADU) to be uncorrelated with the error for peaks in the negative direction 
(around -100 ADU). 

The second effect arises from mismeasurement of the RMS, which leads to an 
error in the normalised peak height (measured in a). This effect is dominated by the 
non-linearity in voltage thresholds within 1-2(7 (12-24 ADU) of zero voltage, within 
which most sample values lie. The magnitude of this effect is difficult to predict, as 
it depends strongly on the variation of the non-linearity across this range, but we can 
place an approximate upper bound of the INL (0.4 ADU) divided by the RMS, or 
~ 3%, corresponding to minimal variation. Because the same RMS value is used to 
normalise both positive and negative peak heights, we expect this error to apply to 
measured excursion rates in both directions. 

The offset of 0.4 ±0.1% in the threshold peak height for positive excursions in 
the first input channel of Bedlam, found in section 1531 is within the possible range 
of the above effects. Repeating the analysis for negative excursions, we find an offset 
of 0.9 ±0.1%; the difference between these is explained entirely by the ±0.4% error 
arising directly from the peak height measurement. Note that these offsets are both 
for the interpolated signal, rather than the envelope: as the envelope is defined to be 
positive, it neglects the distinction between positive and negative excursions. 

Similar results were obtained for all input channels of Bedlam, each of which 
passes through a separate ADC. The results are shown in table [1] and are all within 
the ±3% range expected from RMS measurement error. There is a clear correlation 
between the positive and negative excursion rates for each channel; the two highest 
and two lowest excesses are associated with channels 2A and 4B respectively. This 
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, , non-linearity (%) 
channel 





positive 


negative 


1A 


+0.4 


+0.9 


IB 


+0.3 


+ 1.0 


2A 


+ 1.7 


+2.5 


2B 


+ 1.1 


+ 1.2 


3A 


+0.5 


+0.4 


3B 


+0.8 


+ 1.0 


4A 


+0.8 


+ 1.6 


4B 


-0.4 


-0.3 



Table 1 Offsets in threshold peak height, ascribed to non-linearity, required to produce the observed ex- 
cess for rates of positive and negative excursions for interpolated data in all channels; figure[8]shows the 
measurement for positive excursions in channel 1A. The uncertainty in all measurements is ±0.1%. The 
channel labelling scheme reflects the normal usage of the Bedlam backend, with each channel connected 
to either the A or B polarisation of one of four beams of the Parkes 21 cm multibeam receiver. 



is consistent with the greater part of the common excess originating from error in 
the RMS measurement, while the difference between the excesses for positive and 
negative excursions is caused directly by errors in measurement of the peak height. 
The greatest such difference is 0.8% for channel 2A, which is just compatible with 
independent errors of ±0.4% in measurement of the peak height, combined with 
±0.1% from measurement of the excess in the excursion rate. 



6 Conclusion 

Experiments which search for rare, transient events against a background of Gaussian 
noise, with a time-scale comparable to the inverse bandwidth, require a thorough un- 
derstanding of the noise statistics to evaluate the significance of a possible detection. 
If the signal undergoes a frequency mixing step, a phase ambiguity is introduced, 
and a full search for a possible event requires the computation of the signal envelope. 
We have derived the expected excursion rate from Gaussian noise under these con- 
ditions, and confirmed it through simulation and experiment. We have also described 
and tested Bedlam, a digital backend for the Parkes radio telescope which is designed 
for such an experiment. 
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